A Novel Optimized Language-Independent Text Summarization Technique

نویسندگان

چکیده

A substantial amount of textual data is present electronically in several languages. These texts directed the gear to information redundancy. It essential remove this redundancy and decrease reading time these data. Therefore, we need a computerized text summarization technique extract relevant from group documents with correlated subjects. This paper proposes language-independent extractive technique. The proposed presents clustering-based optimization clustering determines main subjects text, while minimizes redundancy, maximizes significance. Experiments are devised evaluated using BillSum dataset for English language, MLSUM German Russian Mawdoo3 Arabic language. experiments ROUGE metrics. results showed effectiveness compared other language-dependent techniques. Our achieved better metrics all utilized datasets. accomplished an F-measure 41.9% Rouge-1, 18.7% Rouge-2, 39.4% Rouge-3, 16.8% Rouge-4 on average three objectives. system also exhibited improvement 26.6%, 35.5%, 34.65%, 31.54% w.r.t. recent model contributed terms metric evaluation. model’s performance higher than models, especially ROUGE_2 which bi-gram matching.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Language-independent Techniques for Automated Text Summarization

Text summarization is the process of distilling the most important information from source/sources to produce an abridged version for a particular user/users and task/tasks. Automatically generated summaries can significantly reduce the information overload on intelligence analysts in their daily work. Moreover, automated text summarization can be utilized for automated classification and filte...

متن کامل

A language independent approach to multilingual text summarization

This paper describes an efficient algorithm for language independent generic extractive summarization for single document. The algorithm is based on structural and statistical (rather than semantic) factors. Through evaluations performed on a single-document summarization for English, Hindi, Gujarati and Urdu documents, we show that the method performs equally well regardless of the language. T...

متن کامل

A survey on Automatic Text Summarization

Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...

متن کامل

Language Independent Summarization Approaches

In this chapter, the authors introduce monolingual and multilingual summarization and present the problem of dependence of language and linguistic knowledge of the process. Then they describe the most influential works and techniques in the field of automatic multilingual and language-independent summarization. This section is presented as a solution to solve the problem already explained. The ...

متن کامل

Language Independent Extractive Summarization

We demonstrate TextRank – a system for unsupervised extractive summarization that relies on the application of iterative graphbased ranking algorithms to graphs encoding the cohesive structure of a text. An important characteristic of the system is that it does not rely on any language-specific knowledge resources or any manually constructed training data, and thus it is highly portable to new ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computers, materials & continua

سال: 2022

ISSN: ['1546-2218', '1546-2226']

DOI: https://doi.org/10.32604/cmc.2022.031485